Localizing periodicity in near-field images 
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^ ' We show that Bayesian inference, hke that used in statistical mechanics, can guide the system- 

O . atic construction of Fourier dark-field methods for localizing periodicity in near-field (e.g. scanning- 

■ tunneling and electron-phase-contrast) images. For crystals in an aperiodic field, the Fourier coeffi- 

cient Ze^^ combines with a prior estimate for background amplitude B to predict background phase 
(/3) values distributed with a probability p{l3 — if \ Z, ip, B) inversely proportional to the amplitude 
P of the signal of interest, when this latter is treated as an unknown translation scaled to B. From 
UMStL-CME-90f26pf. 

O ■ 61.16.Di, 02.70.+d, 05.90.-fm 



I. INTRODUCTION 



Near-field images (defined here as images of wave amplitude or phase at the exit surface of a solid) with atomic 
■ (i.e. less than 2A) resolution are developments of the last quarter of this century. Transmission electron microscopes 
^ ' capable of delivering phase-contrast images with continuous transfer to spatial frequencies beyond 1 / (2A) have become 
I available in the last decade,EJ and the first scanning tunneling3 and atomic-forceB microscopes able to resolve atoms 
have been created in this period as well. More recently, near-field visible-light microscopes with resolutions much 



below the wavelength Jiimit normally associated with light microscopes, although not with atomic resolution, have 
been described as wellD 



As viewed from frequency space, near-field images with atomic resolution may contain data of three basic types. The 
first type, which we refer to as the "diffraction data", are simply the data on lattice periodicity amplitudes contained in 
image power spectra. All researchers accustomed to obtaining data on lattice parameters or orientation from diffraction 
I patterns have experience with this information. Near-field images, sometimes under a more restricted set of conditions, 
' can also contain "phase- information" on the phase-lag from one periodicity to the next. Diffractionists involved in 
CO . structure determination will recognize that such information is needed, along with diffraction data, to determine the 
• distribution of scattering density within unit cells. Finally, such images also contain "darkfield information" , which 
we define here as information on amplitude and phase differences across the breadth of individual diffracted beams. 
This information tells how near-by periodicities interfere via "beat" processes, and hence how the intensity of any 
given range of periodicities is distributed throughout the region examined. In other words, it tells where crystals and 
"j^ r their boundaries are located in the image field. Researchers involved in small-angle and anomalous Bragg-scattering 
' studies, as well as in diffraction imaging techniques (like x-ray topography or weak-beam electron imaging), all use 
I . this sort of information. Electron and x-ray dark-field techniques which use far-field diffraction contrast to provide 
'■^ ' data on both phase and amplitude components of this information, in particular, have served for decades as powerful 
^ [ tools for the study of defects in (and boundaries around) crystals.Q 

O ' Although the darkfield data in high-resolution near-field images are of demonstrable interest for complementing 
that available via the far-field techniques discussed above, methods far extracting that information have generally 
taken the form of qualitative and ad hoc "recipes" for image processing.^ As one consequence of the large and growing 
amount of data in individual images (e.g., tens of megabytes available in individual electron-phase-contrast negatives), 
formal techniques of physical inference (like the theory of accessible state probabilities used in statistical mechanics) 
d ' can facilitate more rigorous and quantitative study. In this paper, we outline a straijegy for taking steps in this 
regard. With Bayes' theorem as our prescription for physical inference from new data,Ll we first tackle the problem 
of extracting darkfield information on periodic structures buried in an otherwise aperiodic field. This problem is 
common to electron-phase-contrast and air-based scanning-tunneling images, even of purely crystalline fields, because 
specimen preparation and system instabilities, respectively, often give rise to a superposed aperiodic background. 
Second, we discuss how the strategy may be extended to treat a wider class of problems as well. 
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II. THEORY 



Mathematically, suppose we are given an experimental image 2;[a:,?;] whose Fom'ier transform is Z[it, w], and that 
the objective is to construct a darkfield image of selected regions of u,v space associated with one or more peaks in 
the power spectrum Z'^[u, v]. Although we will neglect contributions to the darkfield image from regions of u,v space 
outside of those selected, in the "peak regions" we must separate the Z[u,?;] coefficients into peak P and background 
B components such that P[w, u] = Z[m, u] — B[u,?;]. The "darkfield image" is then just the inverse transform of P, 
namely p[x, y] = z[x, y] — b[x, y]. Here bold print denotes a complex number. We have assumed also that the starting 
image is real, and that P and B as chosen retain the conjugate symmetry of Z (i.e. Z[—u,—v] = Z*[u,v]) so that 
their inverse transforms ^re real as well. 

In order to separate Z into peak and background components, information on the nature of the background is 
crucial. Previously proposed methods make ad hoc assumptions about B[w, u]. For example, the traditional "window 
method" assumes that B[u, v] is zero inside the peak regions. The usefulness by comparison of taking explicit account 
of estimates for the background amplitude beneath peaks was recently demonstrated by O'Keefe and Sattler, but 
only ad-Jioc assumptions (such as a uniformly random distribution) were proposed for the assignment of background 
phases.l3 If, however, information on background phase is taken into account systematically, noise in the calculations 
can be reduced further, and the background-subtraction recipe can serve as a basis for a plethora of strategies for 
direct physical inference. 

Specifically, if for any value of [u, v] we consider calculation of the posterior probability p{f3 \ Z, B)df3 of the back- 
ground phase /3, given the measured coefficient 7i=Ze^^ and in addition a prior information value for the background 
amplitude B, then Bayes' theorem gives 

p{(3 I Z, ^, B)df3 ^ p{p I Z, B)d f^^ ' ^f' ^^ . (1) 

p{ip I Z, B) 

Equation (|l|) tells us how to modify our estimate of the background phase (3 in light of data on the phase 1^9 of Z = P-l-B 
at that frequency. In notational terms, p{j3 \ Z, B)d(3 is the prior probability of P given Z and B; p{ip \ Z, B, (3)dip is 
the likelihood function (or sampling distribution) probability that ip will lie between ip and ip + dip for given values of 
Z, B, and f3; and p{ip \ Z, B)dip is the a priori probability of Lp given Z and B. 

The prior p{Lp \ Z, B)dip will of course have no /3 dependence, since it can be considered an integral over all 
(3. Determination of such /3-independent terms can be considered academic here, since p{P \ Z, B)dj3 obeys the 
normalization condition 

p[(i\Z,B)dp = l. (2) 

The (3 dependence of p(/3 | Z, B)d/3, one of the fundamental priors, will be held for discussion below, the /3 dependence 
of p{ip I Z, B, (3)dip, on the other hand, is less fundamental, and is complicated by the fact that Z is a composite of 
two physically distinct quantities (P and B). Fortunately, we can replace this ip probability with the product of a 
probability for a, the peak phase angle, and a geometric term. This is done by writing the peak phase angle a in 
terms of Z, tp, B, and (3, and then changing differential variables in the normalization equation for p(a \ Z,B,f3)da, 
to get 



p{ip\Z,B,P)dip^p{a\Z,B,P) 



da 



d(p 



dip. (3) 



Here the second factor on the right-hand side is the absolute value of the Jacobian (functional determinant) for the 
variable change. 

To further eliminate Z dependences, note from Bayes' theorem that 

p(a|Z,i3,/?)da=p(a|B,/3)da^^^^^. (4) 

As above, a variable change (here from Z to P) then allows us to write 

dP 



p{Z I a, B, l3)dZ = p{P I a, B, [3) 



dZ 



dZ. (5) 
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The partials in (||) and (||) are easily calculated, and hence these expressions allow us to evaluate the probability 
p(/3 I Z,ip, B)d(3 for any assignment of the peak and background prior probabilities p{/3 \ Z,B)d(3, p{a \ B,l3)da, 
p{P I a,B,(3)dP. 

To evaluate the partials, first note from Fig. 1 that Z can be written in terms of P, a, B, and P as 



Z = |(Pcos H +Pcos [/3]f + (Psin[a] +Psin[/3])^|' , 

and that a can be related to the variables Z, cp, B, and /3 via the expressions 

B sin [P — ip] 



(f — a ~ arctan 



(6) 



(7) 



B cos [P-p]- Z 
BcoslP -ip] - Z' 
P2 _ 2ZB + Z2 

At this point, it is convenient to introduce the following notation for signed angles in the triangle formed by Z, P, 
and B: 9B = (y5 — a, 9p = /3 — 93, and Qz = a — P ~ 7rxsgn[a — /3], where sgn[a;] ={+1 for a; > 0, — 1 for x < 0}. 
Note also the mathematical equivalence oi Qz and 85 under exchange of Z and B. 

By taking the derivative of (||) with respect to P at constant a, B, and P, it is easy to show that (dZ/dP)^ b p ~ 

Z • P/ZP — cos [6s], and hence ]dP/dZ\^ b p — \^^^ [©-b]I- By differentiating (0) with respect to ip at constant Z, 
B, and P, it likewise follows that 



(8) 



da 




1 - 


- {B/Z) cos [P-ip] 


dip 


Z,B,p 


[B/Z? 


-2{B/Z) cos [P^ip] + 1 



_ |cos [6b] I 
~ P/B 

Note, therefore, that the product of the two partials is simply B/P. These facts in hand, we can turn to assignment 
for the prior probabihties p{P \ Z, B)dp, p{a \ B, P)da, and p{P \ a, B, P)dP mentioned above. 

For the frequent times when prior information singles out no preferred direction, uniform (isotropic) priors for both 
P and a [i.e., p{P \ Z,B) = p{a \ B,P) = l/27r] will be appropriate. In any case, these priors do not enter into other 
potentially /3-dependent terms in Eqs. (]l|)-(H). Hence, should we assign them an explicit /3-dependence it will be 
factored into p{P \ Z, ip, 0) directly. 

The most complicated prior is p(P | a,B,P), because it cannot be dismissed by isotropy, and because it enters 
also into the only term whose /9-dependence remains undiscussed, namely p{Z \ B,P) in Eq.(^. We point out here 
two choices of potential interest. A prior for P which (i) appears simple to implement, (ii) results in correlations as 
expected between the direction of B and Z when B fa Z , but (iii) has unclear physical meaning at best is that which 
makes p{a \ Z. P, /?) in Eq. (^) a constant. The probability distribution for /? resulting from such a prior is plotted 
in Fig. 2. To generate pseudorandom values for P with this distribution, one can simply choose values of 6^ (hence 
ot) uniformly distributed in its allowed range for given ZjB, and then calculate P — Lp = 6p from Qz allowing for 
the fact that 6p is a double- valued function of Qz with 50% of its values on each branch when Z < B. Numerical 
simulations in our laboratory have verified that a pseudorandom-number generator so constructed reproduces the 
distribution shown in Fig. 2. 

The physically (if not computationally) most helpful assignment for p{P \ a, B, P) is likely to be the uniform (/?- 
independent) assignment. If we view P not as a scale factor but as the magnitude of a vector translation whose length 
is scaled to the value of P, then p{P \ a, P, P) = const — 1/Pmax for Pmax ^ r^max niay be considered the correct 
transformation-invariant prior probability in the absence of further inforniation.Ll In this case, 



p{Z\B,P) 



P 



max Jo 



[see [eB]\dez- 



(9) 



The argument in this integral is double- valued and must therefore be handled carefully for 65 < Qz, but it can 
nevertheless be written as a function of Z, P, and Qz only. Hence the integral itself is independent of p. Given this 
uniform prior for P, and isotropic priors for a and /?, Eqs. (|l|)-(||) therefore give a /3-dependence for p{P \ Z, ip, B) 
proportional to the product of partials, and inversely proportional to P. This distribution is plotted for various Z/ B 
in Fig. 3. Numerical simulations in our laboratory have confirmed that P vectors with a uniform distribution of 
lengths and orientation angles yield /3-values with the distribution shown. 
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III. SUMMARY 



The Bayesian phase model here does two things. First of ah, it takes the image-processing recipe for "background 
subtraction" proposed by O'Keefe and Sattler and creates a recipe for physical inference by replacing ad hoc assign- 
ments of background phase with an assignment of phases based on physical assumption. Second, it systematically 
minimizes noise in darkfield images associated with crystallite fine structure (e.g., boundaries), since the information 
on fine structure is found in outlying regions of the Bragg peaks where Z/ B is near 1. These are the regions where 
errors associated with an ad hoc phase assignment will have their greatest effect. Because the image itself, not the 
imaging process, constitutes the object of tiie physics in this analysis, it can be applied to images inferred via Bayesian 
statistical analysis of the imaging processt^l as well as to raw experimental data. 

Strategies for obtaining prior information on the amplitude of B in these appli|eations begin with simple extrapo- 
lation of an azimuthally symmetric "cloud" of aperiodic contrast in Fourier space.til Applications of Fourier darkfield 
analysis which are now limited-by resolution and signal-to-noise ratio include the study of isotopically anomalous nm- 
sized diamonds in metcorites,li3 and the study of objects showing space-group disallowed symmetries due to twinning 
and quasicrystallinity. The algorithms here facilitate removal of aperiodic background from selected frequency-space 
regioiia,in images of these structures. Some applications, such as the study of icosahedral structures in periodic 
array,E3 may require removal of periodic background. These involve prior information which goes beyond the scope 
of the result, but not the formalism, discussed here. 

The formalism is also not limited to prior information on background amplitude. Prior information on the variance 
in background amplitude (here assumed to be zero) is a logical addition. When it can be made a practical addition 
remains to be seen. The formalism should also allow us to estimate the strength of periodicities (analogous to 
diffracted beam intensities) from point to point in the field. This is something that diffraction darkfield does which 
Fourier darkfield does less directly. However, the strategy posed here for physical inference from images allows this 
and other questions to be asked mathematically. As another example, diffraction darkfield imaging does not allow us 
to distinguish between crystals with the same periodicity but a different periodicity phase. Atomic-resolution images 
with data on periodicity phase and amplitude contain the needed data, and the strategy proposed here suggests a 
formalism for posing this question as well. 
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FIG. 1. Given complex Fourier coefficients obeying Ze*''' = Pe'" + i?e''^, this schematic illustrates the geometric relationship 
between peak phase angle a, and background phase angle /?, when Z, ip, and B are specified. 

FIG. 2. A plot of p(/3 I Z, if, B) in probability per radian interval vs. (3 — ip in radians for the computationally convenient 
case when p{(3 \ Z,B) = 1/2it, and p{a \ Z, B, fi) is independent of [3. Note the strong bias toward /3 w as Z/B approaches 
1, and the cusps for Z/B < I. An algorithm for generating /? values with this distribution is described in the text. Notation 
for Z/B: solid line, |; plus, i; cross, 2; diamond, 8. 



FIG. 3. A plot of p(l3 I Z, if, B) in probability per radian interval vs. (3 — ifi in radians for the physically realistic case when 
p(/3 \ Z,B) — p{a \ B,[3) = 1/2-k, and p{P \ a,B,/3) = const = 1/Pmax, where Pmax 3> Zma^- Note the bias toward f3 ^ (p as 
Z/B approaches 1. The bias will weaken with increasing variance in B when nonzero variances are included as prior information 
in the calculation. Notation for Z/B: solid line, {512, -^y; plus, {8, |}; cross, {2, i}; diamond, {2^/^, 2~^/^}. 
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